Search CORE

9 research outputs found

MACHINE LEARNING APPROACHES FOR BIOMARKER IDENTIFICATION AND SUBGROUP DISCOVERY FOR POST-TRAUMATIC STRESS DISORDER

Author: Lu Liangqun
Publication venue: University of Memphis Digital Commons
Publication date: 01/01/2020
Field of study

Post-traumatic stress disorder (PTSD) is a psychiatric disorder caused by environmental and genetic factors resulting from alterations in genetic variation, epigenetic changes and neuroimaging characteristics. There is a pressing need to identify reliable molecular and physiological biomarkers for accurate diagnosis, prognosis, and treatment, as well to deepen the understanding of PTSD pathophysiology. Machine learning methods are widely used to infer patterns from biological data, identify biomarkers, and make predictions. The objective of this research is to apply machine learning methods for the accurate classification of human diseases from genome-scale datasets, focusing primarily on PTSD.The DoD-funded Systems Biology of PTSD Consortium has recruited combat veterans with and without PTSD for measurement of molecular and physiological data from blood or urine samples with the goal of identifying accurate and specific PTSD biomarkers. As a member of the Consortium with access to these PTSD multiple omics datasets, we first completed a project titled Clinical Subgroup-Specific PTSD Classification and Biomarker Discovery. We applied machine learning approaches to these data to build classification models consisting of molecular and clinical features to predict PTSD status. We also identified candidate biomarkers for diagnosis, which improves our understanding of PTSD pathogenesis. In a second project, entitled Multi-Omic PTSD Subgroup Identification and Clinical Characterization, we applied methods for integrating multiple omics datasets to investigate the complex, multivariate nature of the biological systems underlying PTSD. We identified an optimal 2 PTSD subgroups using two different machine learning approaches from 82 PTSD positive samples, and we found that the subgroups exhibited different remitting behavior as inferred from subjects recalled at a later time point. The results from our association, differential expression, and classification analyses demonstrated the distinct clinical and molecular features characterizing these subgroups.Taken together, our work has advanced our understanding of PTSD biomarkers and subgroups through the use of machine learning approaches. Results from our work should strongly contribute to the precise diagnosis and eventual treatment of PTSD, as well as other diseases. Future work will involve continuing to leverage these results to enable precision medicine for PTSD

University of Memphis Digital Commons

Multi-Omic Data Integration to Stratify Population in Hepatocellular Carcinoma

Author: Lu Liangqun
Publication venue: [Honolulu] : [University of Hawaii at Manoa], [August 2016]
Publication date: 01/08/2016
Field of study

M.S. University of Hawaii at Manoa 2016.Includes bibliographical references

ScholarSpace at University of Hawai'i at Manoa

Prognostic analysis of histopathological images using pre-trained convolutional neural networks: Application to hepatocellular carcinoma

Author: Daigle Bernie J.
Lu Liangqun
Publication venue: 'PeerJ'
Publication date: 01/01/2020
Field of study

Histopathological images contain rich phenotypic descriptions of the molecular processes underlying disease progression. Convolutional neural networks, state-of-the-art image analysis techniques in computer vision, automatically learn representative features from such images which can be useful for disease diagnosis, prognosis, and subtyping. Hepatocellular carcinoma (HCC) is the sixth most common type of primary liver malignancy. Despite the high mortality rate of HCC, little previous work has made use of CNN models to explore the use of histopathological images for prognosis and clinical survival prediction of HCC. We applied three pre-trained CNN models-VGG 16, Inception V3 and ResNet 50-to extract features from HCC histopathological images. Sample visualization and classification analyses based on these features showed a very clear separation between cancer and normal samples. In a univariate Cox regression analysis, 21.4% and 16% of image features on average were significantly associated with overall survival (OS) and disease-free survival (DFS), respectively. We also observed significant correlations between these features and integrated biological pathways derived from gene expression and copy number variation. Using an elastic net regularized Cox Proportional Hazards model of OS constructed from Inception image features, we obtained a concordance index (C-index) of 0.789 and a significant log-rank test (p = 7.6E-18). We also performed unsupervised classification to identify HCC subgroups from image features. The optimal two subgroups discovered using Inception model image features showed significant differences in both overall (C-index = 0.628 and p = 7.39E-07) and DFS (C-index = 0.558 and p = 0.012). Our work demonstrates the utility of extracting image features using pre-trained models by using them to build accurate prognostic models of HCC as well as highlight significant correlations between these features, clinical survival, and relevant biological pathways. Image features extracted from HCC histopathological images using the pre-trained CNN models VGG 16, Inception V3 and ResNet 50 can accurately distinguish normal and cancer samples. Furthermore, these image features are significantly correlated with survival and relevant biological pathways

University of Memphis Digital Commons

Directory of Open Access Journals

GEOlimma: differential expression analysis and feature selection using pre-existing microarray data

Author: Daigle Bernie J.
Lu Liangqun
Townsend Kevin A.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2021
Field of study

Background: Differential expression and feature selection analyses are essential steps for the development of accurate diagnostic/prognostic classifiers of complicated human diseases using transcriptomics data. These steps are particularly challenging due to the curse of dimensionality and the presence of technical and biological noise. A promising strategy for overcoming these challenges is the incorporation of pre-existing transcriptomics data in the identification of differentially expressed (DE) genes. This approach has the potential to improve the quality of selected genes, increase classification performance, and enhance biological interpretability. While a number of methods have been developed that use pre-existing data for differential expression analysis, existing methods do not leverage the identities of experimental conditions to create a robust metric for identifying DE genes. Results: In this study, we propose a novel differential expression and feature selection method—GEOlimma—which combines pre-existing microarray data from the Gene Expression Omnibus (GEO) with the widely-applied Limma method for differential expression analysis. We first quantify differential gene expression across 2481 pairwise comparisons from 602 curated GEO Datasets, and we convert differential expression frequencies to DE prior probabilities. Genes with high DE prior probabilities show enrichment in cell growth and death, signal transduction, and cancer-related biological pathways, while genes with low prior probabilities were enriched in sensory system pathways. We then applied GEOlimma to four differential expression comparisons within two human disease datasets and performed differential expression, feature selection, and supervised classification analyses. Our results suggest that use of GEOlimma provides greater experimental power to detect DE genes compared to Limma, due to its increased effective sample size. Furthermore, in a supervised classification analysis using GEOlimma as a feature selection method, we observed similar or better classification performance than Limma given small, noisy subsets of an asthma dataset. Conclusions: Our results demonstrate that GEOlimma is a more effective method for differential gene expression and feature selection analyses compared to the standard Limma method. Due to its focus on gene-level differential expression, GEOlimma also has the potential to be applied to other high-throughput biological datasets

University of Memphis Digital Commons

Multi-omic biomarker identification and validation for diagnosing warzone-related post-traumatic stress disorder

Author: Abu-Amara Duna
Almli Lynn M.
Bersani F. Saverio
Chakraborty Nabarun
Daigle Bernie J.
Dean Kelsey R.
Donohue Duncan
Doyle III Francis J.
Flory Janine D.
Gautam Aarti
Guffanti Guia
Hammamieh Rasha
Hood Leroy
Jett Marti
Kerley Kimberly
Kim Taek Kyun
Laska Eugene
Lee Inyoul
Lindqvist Daniel
Lori Adriana
Lu Liangqun
Marmar Charles
Mellon Synthia H.
Misganaw Burook
Muhie Seid
Newman Jennifer
Price Nathan D.
Qin Shizhen
Ressler Kerry J.
Reus Victor I.
Siegel Carole
Somvanshi Pramod R.
Thakur Gunjan S.
The PTSD Systems Biology Consortium
Wang Kai
Wolkowitz Owen M.
Yang Ruoting
Yehuda Rachel
Young Lee Min
Zhou Yong
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/09/2019
Field of study

Post-traumatic stress disorder (PTSD) impacts many veterans and active duty soldiers, but diagnosis can be problematic due to biases in self-disclosure of symptoms, stigma within military populations, and limitations identifying those at risk. Prior studies suggest that PTSD may be a systemic illness, affecting not just the brain, but the entire body. Therefore, disease signals likely span multiple biological domains, including genes, proteins, cells, tissues, and organism-level physiological changes. Identification of these signals could aid in diagnostics, treatment decision-making, and risk evaluation. In the search for PTSD diagnostic biomarkers, we ascertained over one million molecular, cellular, physiological, and clinical features from three cohorts of male veterans. In a discovery cohort of 83 warzone-related PTSD cases and 82 warzone-exposed controls, we identified a set of 343 candidate biomarkers. These candidate biomarkers were selected from an integrated approach using (1) data-driven methods, including Support Vector Machine with Recursive Feature Elimination and other standard or published methodologies, and (2) hypothesis-driven approaches, using previous genetic studies for polygenic risk, or other PTSD-related literature. After reassessment of ~30% of these participants, we refined this set of markers from 343 to 28, based on their performance and ability to track changes in phenotype over time. The final diagnostic panel of 28 features was validated in an independent cohort (26 cases, 26 controls) with good performance (AUC = 0.80, 81% accuracy, 85% sensitivity, and 77% specificity). The identification and validation of this diverse diagnostic panel represents a powerful and novel approach to improve accuracy and reduce bias in diagnosing combat-related PTSD

Lund University Publications

Crossref

eScholarship - University of California

Providence St. Joseph Health Digital Commons

Author: Acemoglu
Ahern
Ashley
Avner Greif
Bai
Ballard
Banfield
Barber
Ben-Amos
Beresford
Berman
Bernhofen
Bian
Boerner
Bourgon
Brenner
Brenner
Cantoni
Chen
Ch’u
Clark
Cohen
Cordery
Davis
Dawson
de Moor
de Roover
Deng
Deng
Dixit
Dollinger
Drew
Duby
Duffy
Ebrey
Ekelund
English Historical Documents
Faure
Fei
Fisher
Folena
Freedman
Friedmann
Gelderblom
Gellhorn
González de Lara
Goody
Gorodnichenko
Greif
Greif
Greif
Greif
Greif
Greif
Greif
Guichard
Guido Tabellini
Guinnane
Hartwell
Heijdra
Herlihy
Huang
Huang
Huen
Hughes
Hung
Jha
Jűtte
Kelly
Kessler
King
Kiser
Kuhn
Kulp
Kumar
Kuran
Kuroda
Landa
Lary
Laslett
Li
Li
Liangqun
Liu
Liu
Lu
Ma
Mann
Mitterauer
Mok
Nader
Nakamura
Nee
Nicholas
North
Nunn
Pasternak
Peng
Perkins
Pirenne
Pomeranz
Postan
Powell
Pyatt
Razi
Redding
Reed
Reynolds
Reynold’s
Richardson
Rosenthal
Rowe
Rowe
Rozman
Ruskola
Schofield
Shiue
Smith
Sng
Sommerville
Strum
Stubbs
Su
Szonyi
Tabellini
Tabellini
Tait
Telford
Thøgersen
Tilly
Trenerry
Tsai
Van Doosselaere
Van Leeuwen
Voigtländer
Waley
Watson
Whyte
Whyte
Zhang
Zhenman
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

MACHINE LEARNING APPROACHES FOR BIOMARKER IDENTIFICATION AND SUBGROUP DISCOVERY FOR POST-TRAUMATIC STRESS DISORDER

Multi-Omic Data Integration to Stratify Population in Hepatocellular Carcinoma

Prognostic analysis of histopathological images using pre-trained convolutional neural networks: Application to hepatocellular carcinoma

GEOlimma: differential expression analysis and feature selection using pre-existing microarray data

Multi-omic biomarker identification and validation for diagnosing warzone-related post-traumatic stress disorder

Proceedings of the 16th Annual UT-KBRIN Bioinformatics Summit 2016: bioinformatics

Proceedings of the 16th Annual UT-KBRIN Bioinformatics Summit 2016: bioinformatics

The clan and the corporation: Sustaining cooperation in China and Europe